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(57) A publish/subscribe data processing broker ap- 
paratus has: a means for receiving from a publisher ap- 
plication data messages published on a subject-specific 
stream by the publisher application; and a means for dis- 
tributing the received published data messages to a 
subscriber application which has requested to receive 
messages on the stream upon which the published mes- 
sages were published, the means for distributing includ- 
ing a plurality of connected distribution agent data 
processing units; wherein one of the streams is reserved 
for messages published by a publishing distribution 
agent data processing unit and intended for receipt by 
a subscribing distribution agent data processing unit lo- 
cated adjacent the publishing distribution agent data 
processing unit or intended for receipt by a subscriber 
application located adjacent the publishing distribution 
agent data processing unit. 
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Description 

Field of the Invention 

[0001] The present invention relates to the field of da- 
ta processing and more specifically to data processing 
which distributes messages from suppliers (called, 
hereinafter, "publishers") of data messages to consum- 
ers (called, hereinafter "subscribers") of such messag- 
es. 

Background of the Invention 

[0002] Publish/subscribe data processing systems 
have become very popular in recent years as a way of 
distributing data messages from publishing computers 
to subscribing computers. The increasing popularity of 
the Internet, which has connected a wide variety of com- 
puters all over the world, has helped to make such pub- 
lish/subscribe systems even more popular. Using the In- 
ternet, a World Wide Web browser application (the term 
"application" or "process" refers to a software program, 
or portion thereof, running on a computer) can be used 
in conjunction with the publisher or subscriber in order 
to graphically display messages. Such systems are es- 
pecially useful where data supplied by a publisher is 
constantly changing and a large number of subscribers 
needs to be quickly updated with the latest data. Per- 
haps the best example of where this is useful is in the 
distribution of stock market data. 
[0003] In such systems, publisher applications of data 
messages do not need to know the identity or location 
of the subscriber applications which will receive the 
messages. The publishers need only connect to a pub- 
lish/subscribe distribution agent process, which is in- 
cluded in a group of such processes making up a broker 
system (referred to hereafter as a broker), and send 
messages to the distribution agent process, specifying 
the subject of the message to the distribution agent 
process. The distribution agent process then distributes 
the published messages to subscriber applications 
which have previously indicated to the broker that they 
would like to receive data messages on particular sub- 
jects. Thus, the subscribers also do not need to know 
the identity or location of the publishers. The subscribers 
need only connect to a distribution agent process. 
[0004] One such publish/subscribe system which is 
currently in use is shown in Fig. 1. Publishers 11 and 12 
connect to the publish/subscribe broker 2 and send pub- 
lished messages to broker 2 which distributes the mes- 
sages to subscribers 31, 32, 33, 34. Publishers 11 and 
1 2, which are data processing applications which output 
data messages, connect to broker 2 using the well 
known inter-application data connection protocol known 
as remote procedure call (or RPC). Each publisher ap- 
plication could be running on a separate machine, alter- 
natively, a single machine could be running a plurality 
of publisher applications. The broker 2 is made up of a 



plurality of distribution agents (21 through 27) which are 
connected in a hierarchial fashion which will be de- 
scribed below as a "tree structure". These distribution 
agents, each of which could be running on a separate 
machine, are data processing applications which distrib- 
ute data messages through the broker 2 from publishers 
to subscribers. Subscriber applications 31, 32, 33 and 
34 connect to the broker 2 via RPC in order to receive 
published messages. 

[0005] Publishers 1 1 and 1 2 first connect via RPC di- 
rectly to a root distribution agent 21 which in turn con- 
nects via RPC to second level distribution agents 22 and 
23 which in turn connect via RPC to third level distribu- 
tion agents 24, 25, 26 and 27 (also known as "leaf dis- 
tribution agents" since they are the final distribution 
agents in the tree structure). Each distribution agent 
could be running on its own machine, or alternatively, 
groups of distribution agents could be running on the 
same machine. The leaf distribution agents connect via 
RPC to subscriber applications 31 through 34, each of 
which could be running on its own machine. 
[0006] In order to allow the broker 2 to determine 
which published messages should be sent to which sub- 
scribers, publishers provide the root distribution agent 
21 with the name of a distribution stream for each pub- 
lished .message. A distribution stream (called hereinaf- 
ter a "stream") is an ordered sequence of messages 
having a name (e.g., "stock" for a stream of stock market 
quotes) to distinguish the stream from other streams. 
Likewise, subscribers provide the leaf distribution 
agents 31 through 34 with the name of the streams to 
which they would like to subscribe. In this way, the bro- 
ker 2 keeps track of which subscribers are interested in 
which streams so that when publishers publish messag- 
es to such streams, the messages can be distributed to 
the corresponding subscribers. Subscribers are also al- 
lowed to provide filter expressions to the broker in order 
to limit the messages which will be received on a partic- 
ular stream (e.g., a subscriber 31 interested in only IBM 
stock quotes could subscribe to the stream "stock" by 
making an RPC call to leaf distribution agent 24 and in- 
clude a filter expression stating that only messages on 
the "stock" stream relating to IBM stock should be sent 
to subscriber 31). 

[0007] Oftentimes, it is necessary for a distribution 
agent to inform its "children" (i.e. , the distribution agents 
directly underneath the distribution agent) of some 
event, so that the children can take some appropriate 
action. For example, if distribution agent 22 should lose 
its connection to its parent (the root distribution agent 
21), distribution agent 22 needs to inform its children 
distribution agents 24 and 25 so that such children can 
take an appropriate action which has been pre-config- 
ured by the systems administrator (e.g., the distribution 
agents 24 and 25 could either try to connect to another 
parent, such as distribution agent 23, or they could sim- 
ply stay with distribution agent 22 and wait until distri- 
bution agent 22 regains its connection with the root dis- 
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tribution agent 21). 

[0008] In the prior art publish/subscribe broker, this 
type of communication between a parent distribution 
agent and its children has involved the parent having to 
send a dedicated command, outside of the normal pub- 5 
lish/subscribe message flow. This increases the types 
of traffic flowing between distribution agents. Further, 
one child may receive messages in a different order as 
compared to one of its siblings, due to the fact that the 
parent must send separate commands to each child. Ac- 10 
cordingly, the prior state of the art in this area has re- 
suited in an inefficient use of available resources and a 
generally unsatisfactory architectural approach. 

Disclosure of the Invention * 5 

[0009] According to one aspect, the present invention 
provides a publish/subscribe data processing broker ap- 
paratus having: a means for receiving from a publisher 
application data messages published on a subject-spe- 20 
cific stream by the publisher application; and a means 
for distributing the received published data messages 
to a subscriber application which has requested to re- 
ceive messages on the stream upon which the pub- 
lished messages were published, the means for distrib- 25 
uting including a plurality of connected distribution agent 
data processing units; wherein one of the streams is re- 
served for messages published by a publishing distribu- 
tion agent data processing unit and intended for receipt 
by a subscribing distribution agent data processing unit 30 
located adjacent the publishing distribution agent data 
processing unit or intended for receipt by a subscriber 
application located adjacent the publishing distribution 
agent data processing unit. 

[0010] According to a second aspect, the present in- 3S 
vent ion provides a publish/subscribe data processing 
broker apparatus having: a means for receiving from a 
publisher application data messages published on a 
subject-specific stream by the publisher application; and 
a means for distributing the received published data 40 
messages to a subscriber application which has re- 
quested to receive messages on the stream upon which 
the published messages were published, the means for 
distributing including a plurality of distribution agent data 
processing units arranged in a hierarchial structure in- 45 
eluding a plurality of ordered levels including a top level 
closest to publisher applications and a bottom level clos- 
est to subscriber applications; wherein one of the 
streams is reserved for messages published by a pub- 
lishing distribution agent data processing unit located at 50 
one level of the hierarchy and intended for receipt by a 
subscribing distribution agent data processing unit lo- 
cated at a neighbouring level of the hierarchy or intend- 
ed for receipt by a subscriber application if the publish- 
ing distribution agent data processing unit is located at 55 
the bottom level of the hierarchy. 
[0011] Preferably, upon receipt of a message pub- 
lished on the reserved stream, the subscribing distribu- 



tion agent data processing unit or subscriber application 
takes a predetermined action depending on the con- 
tents of the received published message. Further pref- 
erably, at least one of the publisher application or the 
subscriber application makes use of an Internet World 
Wide Web browser program to allow published messag- 
es to be sent or received over the Internet. 
[0012] The present invention reuses the existing 
stream architecture in order to enable communication 
between neighbouring distribution agents (and between 
a distribution agent and its directly connected subscrib- 
ers). Therefore, it is not necessary to use a separate 
communication type for such communication, resulting 
in a highly efficient use of available system resources. 
[0013] Further, data messages from, for example, a 
parent distribution agent to its children are always de- 
livered in order on a stream thus ensuring that all such 
messages are received in the exact same order by each 
such child, thus serving to provide a high degree of sta- 
bility and predictability of result. 
[0014] According to a third and a fourth aspect, the 
present invention provides a data processing method 
having method steps corresponding to each element of 
the data processing apparatus of the first and second 
aspect, respectively, of the invention. 
[0015] According to a fifth and sixth aspect, the 
present invention provides a computer readable storage 
medium having a computer program stored on it which, 
when executed on a computer, carries out the function- 
ality of data processing method of the third and fourth 
aspect, respectively, of the invention. 

Brief Description of the Drawings 

[0016] The invention will be better understood by re- 
ferring to the detailed description of the preferred em- 
bodiments which will now be described in conjunction 
with the following drawing figures: 

Figure 1 shows the architecture of the publish/sub- 
scribe system to which the preferred embodiment 
of the present invention relates; 

Figure 2 is a flowchart showing the steps taken by 
a parent distribution agent, according to a preferred 
embodiment of the present invention; 

Figure 3 is a flowchart showing the steps taken by 
children distribution agents, according to a pre- 
ferred embodiment of the present invention; and 

Figure 4 shows an alternate version of the broker 
architecture of Fig. 1 after the steps in the flowcharts 
of Figs. 2 and 3 are executed. 

Detailed Description of the Preferred Embodiments 

[001 7] In Fig. 1 a publisher application 1 1 , running on 
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one computer, is, for example, a supplier of live stock 
market data quotes. That is, publisher application 11 
provides up to the minute messages stating the present 
value of share prices. In this example, publisher appli- 
cation 11 is publishing messages on a stream called 
"stock" which has already been configured in the broker 
2. As is well known, when publisher 11 wishes to publish 
a stock quote message to stream "stock", publisher 11 
makes an RPC call to the root distribution agent 11 
which is at the top level of the broker tree structure. In 
this example, subscriber application 32, running on an- 
other computer, has sent a subscription request via an 
RPC call to leaf distribution agent 24, which is at the 
bottom level of the tree structure, indicating that sub- 
scriber 32 would like to subscribe to stream "stock". 
[0016] Thus, whenever publisher 11 publishes a data 
message to stream "stock" the distribution tree structure 
of broker 2 channels the message down through the root 
distribution agent 21 , through any intermediary distribu- 
tion agents (e.g., 22 in the example of Fig. 1) and 
through the leaf distribution agent 24 to the subscriber 
32. This involves a series of RPC calls being made be- 
tween each successive circle in the diagram of Fig. 1 
connecting publisher 11 and subscriber 32 (i.e., 11 to 
21 , 21 to 22, 22 to 24 and 24 to 32). 
[001 9] This, so far, describes the operation of the prior 
art publish/subscribe broker. The preferred embodiment 
of the present invention will now be described. 
[0020] If distribution agent 22 has lost its connection 
to its parent (i.e., root distribution agent 21), then this 
means that the distribution agents 24 and 25, which are 
children of distribution agent 22, will not be able to re- 
ceive any further data messages until distribution agent 
22 regains its connection to root distribution agent 21. 
Of course, this also means that subscribers 31 and 32 
connected to distribution agent 24 (and any subscribers 
connected to distribution agent 25) also will not receive 
any data messages. Distribution agent 22 must inform 
its children 24 and 25 of the connection loss so that the 
children can take an appropriate action (such as con- 
necting to a sibling of distribution agent 22 such as dis- 
tribution agent 23). The specifics of which appropriate 
action should be taken by the children will have been 
specified in advance by the systems administrator as a 
preconfigured policy. 

[0021] The operational steps taken by distribution 
agent 22, according to a preferred embodiment of the 
present invention, are illustrated by the flowchart of Fig. 
2. 

[0022] At step 201, distribution agent 22 detects the 
fact that it has lost its connection to its parent (the root 
distribution agent 21). At step 202, distribution agent 22 
publishes a message on a stream called "infostream" 
which is reserved for use in communicating information 
between a parent distribution agent and its children. On- 
ly distribution agents can act as publishers on this 
stream, and a security check is made to determine 
whether a publisher is a distribution agent before a pub- 
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lisher is allowed to publish on this stream. Whenever a 
new child (be it a distribution agent or a subscriber) con- 
nects to a distribution agent (which is that child's parent) 
the child subscribes to the stream "infostream", so that 
5 the child can easily receive messages from its parent 
distribution agent. The message which distribution 
agent 22 publishes at step 202 informs the agent 22's 
children 24 and 25 of the fact that agent 22 has lost its 
. connection to agent 21. 
10 [0023] The operational steps taken by children distri- 
bution agents 24 and 25 upon receiving the message 
published by parent distribution agent 22 at step 202, 
according to a preferred embodiment of the present in- 
vention, are illustrated by the flowchart of Fig. 3. 
is [0024] At step 301 , the children distribution agents 24 
and 25 receive the message published by parent distri- 
bution agent 22. This is carried out by the parent distri- 
bution agent 22 initiating an RPC call to each child dis- 
tribution agent 24 and 25. The children distribution 
agents have previously subscribed to the stream "infos- 
tream" when they first joined the tree as children of dis- 
tribution agent 22. At step 302, the children distribution 
agents 24 and 25 follow a preconfigured policy set by 
the systems administrator in order to inform the agents 
24 and 25 as to what to do if they receive a message on 
stream "infostream" from their parent telling them that 
the parent has lost its connection to its parent. In this 
example, the preconfigured policy is that in this case the 
children distribution agents should connect to an alter- 
nate parent 23 which is a sibling of the original parent 
22 which has lost its connection to its parent 21 . Thus, 
agents 24 and 25 make an RPC call to agent 23 (which 
will in turn make an RPC call to root distribution agent 
21) requesting that any published messages which 
would have been sent to agents 24 and 25 via agent 22 
should now be sent via agent 23. A modified version of 
the publish/subscribe broker 2's tree structure illustrat- 
ing this change is shown in Fig. 4. 
[0025] It should be noted that an alternative policy 
which the children could follow after step 301 would be 
to simply wait until the parent agent 22 has regained its 
connection to root agent 21 . However, this policy results 
in subscribers 31 and 32 experiencing a delay in receiv- 
ing messages while the connection loss exists. When 
the connection is again established, the messages pub- 
lished while the connection was down can be retrieved 
from persistent storage (maintained by root agent 21) 
and then sent to subscribers 31 and 32 via agents 22 
and 24. 

[0026] In the above example of the use of the stream 
"infostream" to communicate between a parent and its 
children, a parent sent a message to each of its children. 
As with other streams in the distribution tree, a parent 
can also select which of its children to send a particular 
message to on stream "infostream" depending on the 
nature of the message being sent. For example, another 
use of the stream "infostream" is where a parent distri- 
bution agent 24 wishes to inform a subscriber 31 (which 
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Is a child of agent 24) that the subscriber 31 has lost its 
authority to receive messages on a stream (e.g., the 
"stock" stream). This could be, for example, where a 
company has reorganized and made changes as to who 
is allowed to view certain data. On the other hand, the 
subscriber 32 has not lost its authority to receive mes- 
sages on this stream. Thus, the parent 24 publishes a 
message to only subscriber 31 via RPC on stream "in- 
f ostream" with a message content that informs subscrib- 
er 31 that subscriber 31 is no longer authorized to re- 
ceive messages on the stream (e.g., the "stock" 
stream). In this case, there is no particular response 
which the subscriber must take upon receiving this mes- 
sage. However, the subscriber 31 is clearly informed 
that it is not to expect any more messages on the "stock" 
stream. 

[0027] A still further use of the "infostream" stream is 
in a load balancing situation. Assume a leaf distribution 
agent 24 is very busy sending messages to one of its 
children (subscriber 31 ), that is, there is a heavy load of 
messages going to subscriber 31 . Parent distribution 
agent 24 could publish a message on stream "infos- 
tream" telling its other child (subscriber 32) to connect 
to distribution agent 25 temporarily during the period of 
heavy traffic to subscriber 32. Subscriber 32 would then 
connect to agent 25 by making an RPC call thereto. At 
a later time, should agent 25 become overburdened due 
to the extra load, agent 25 can publish a message on 
stream "infostream" informing subscriber 32 to recon- 
nect to its original parent 24. Subscriber 32, in response 
to this published message, would then reconnect to its 
original parent 24 via an RPC call thereto. 
[0028] While a hierarchial tree structure of distribution 
agent processes has been described in the preferred 
embodiment for illustrative purposes, many other archi- 
tectures are included within the scope of the invention. 
The invention can also be used in the context of a group 
of cooperating processes spanning different types of 
networks (including local area and wide area networks), 
different types of machines and using different inter- 
process communication protocols (i.e., RPCs, sockets, 
TCP/IP or any other well known inter-process commu- 
nication protocol). For example, the invention can also 
be used in the environment of the Internet, which in- 
volves a plurality of different network types connecting 
different types of machines. In embodiments which do 
not use the tree structure, the "infostream" is used to 
pass information not between parent and child process- 
es but more broadly information is passed between 
neighbouring processes (i.e., processes which are ad- 
jacent to each other and communicate directly with each 
other using a well-known inter-process communication 
protocol without going through an intermediary proc- 
ess). 



Claims 

1. A publish/subscribe data processing broker appa- 
ratus comprising: 

5 

means for receiving from a publisher applica- 
tion data messages published on a subject- 
specific stream by the publisher application; 
and 

10 

means for distributing the received published 
data messages to a subscriber application 
which has requested to receive messages on 
the stream upon which the published messag- 
es es were published, the means lor distributing 
including a plurality of connected distribution 
agent data processing units; 

wherein one of the streams is reserved for mes- 
20 sages published by a publishing distribution 

agent data processing unit and intended for re- 
ceipt by a subscribing distribution agent data 
processing unit located adjacent the publishing 
distribution agent data processing unit or in- 
25 tended for receipt by a subscriber application 

located adjacent the publishing distribution 
agent data processing unit. 

2. A publish/subscribe data processing broker appa- 
30 ratus comprising: 

means for receiving from a publisher applica- 
tion data messages published on a subject- 
specific stream by the publisher application; 
35 and 

means for distributing the received published 
data messages to a subscriber application 
which has requested to receive messages on 
40 the stream upon which the published messag- 

es were published, the means for distributing 
including a plurality of distribution agent data 
processing units arranged in a hierarchial struc- 
ture including a plurality of ordered levels in- 
cluding a top level closest to publisher applica- 
tions and a bottom level closest to subscriber 
applications; 



wherein one of the streams is reserved for mes- 
50 sages published by a publishing distribution 

agent data processing unit located at one level 
of the hierarchy and intended for receipt by a 
subscribing distribution agent data processing 
unit located at a neighbouring level of the hier- 
55 archy or intended for receipt by a subscriber ap- 

plication if the publishing distribution agent data 
processing unit is located at the bottom level of 
the hierarchy. 
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3. The apparatus of claim 1 wherein upon receipt of a 
message published on the reserved stream, the 
subscribing distribution agent data processing unit 
or subscriber application takes a predetermined ac- 
tion depending on the contents of the received pub- 
lished message. 

4. The apparatus of claim 1 wherein at least one of the 
publisher application or the subscriber application 
makes use of an Internet World Wide Web browser 
program to allow published messages to be sent or 
received over the Internet. 

5. A publish/subscribe data processing broker method 
comprising steps of: 

means for receiving from a publisher applica- 
tion data messages published on a subject- 
specific stream by the publisher application; 
and 

means for distributing the received published 
data messages to a subscriber application 
which has requested to receive messages on 
the stream upon which the published messag- 
es were published, the means for distributing 
including a plurality of connected distribution 
agent data processing units; 

wherein one of the streams is reserved for mes- 
sages published by a publishing distribution 
agent data processing unit and intended for re- 
ceipt by a subscribing distribution agent data 
processing unit located adjacent the publishing 
distribution agent data processing unit or in- 
tended for receipt by a subscriber application 
located adjacent the publishing distribution 
agent data processing unit. 

6. A publish/subscribe data processing broker method 
comprising steps of: 

receiving from a publisher application data 
messages published on a subject-specific 
stream by the publisher application; and 

distributing the received published data mes- 
sages to a subscriber application which has re- 
quested to receive messages on the stream up- 
on which the published messages were pub- 
lished, the means for distributing including a 
plurality of distribution agent data processing 
units arranged in a hierarchial structure includ- 
ing a plurality of ordered levels including a top 
level closest to publisher applications and a 
bottom level closest to subscriber applications; 

wherein one of the streams is reserved for mes- 



sages published by a publishing distribution 
agent data processing unit located at one level 
of the hierarchy and intended for receipt by a 
subscribing distribution agent data processing 
5 unit located at a neighbouring level of the hier- 

archy or intended for receipt by a subscriber ap- 
plication if the publishing distribution agent data 
processing unit is located at the bottom level of 
the hierarchy. 

10 

7. A computer program product stored on a computer 
readable storage medium for, when executed by a 
computer, carrying out a publish/subscribe data 
processing broker method comprising steps of: 

15 

means for receiving from a publisher applica- 
tion data messages published on a subject- 
specific stream by the publisher application; 
and 

20 

means for distributing the received published 
data messages to a subscriber application 
which has requested to receive messages on 
the stream upon which the published messag- 
es es were published, the means for distributing 
including a plurality of connected distribution 
agent data processing units; 

wherein one of the streams is reserved for mes- 
30 sages published by a publishing distribution 

agent data processing unit and intended for re- 
ceipt by a subscribing distribution agent data 
processing unit located adjacent the publishing 
distribution agent data . processing unit or in- 
35 tended for receipt by a subscriber application 

located adjacent the publishing distribution 
agent data processing unit. 

8. A computer program product stored on a computer 
40 readable storage medium for, when executed by a 

computer, carrying out a publish/subscribe data 
processing broker method comprising steps of: 

receiving from a publisher application data 
45 messages published on a subject-specific 

stream by the publisher application; and 

distributing the received published data mes- 
sages to a subscriber application which has re- 
50 quested to receive messages on the stream up- 

on which the published messages were pub- 
lished, the means for distributing including a 
plurality of distribution agent data processing 
units arranged in a hierarchial structure includ- 
es . ing a plurality of ordered levels including a top 
level closest to publisher applications and a 
bottom level closest to subscriber applications; 
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wherein one of the streams is reserved for mes- 
sages published by a publishing, distribution 
agent data processing unit located at one level 
of the hierarchy and intended for receipt by a 
subscribing distribution agent data processing 5 
unit located at a neighbouring level of the hier- 
archy or intended for receipt by a subscriber ap- 
plication if the publishing distribution agent data 
processing unit is iocated at the bottom level of 
the hierarchy. *o 
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